Robust Minimax Probability Machine Regression ; CU-CS-952-03
نویسندگان
چکیده
We formulate regression as maximizing the minimum probability (Ω) that the true regression function is within ±2 of the regression model. Our framework starts by posing regression as a binary classification problem, such that a solution to this single classification problem directly solves the original regression problem. Minimax probability machine classification (Lanckriet et al., 2002a) is used to solve the binary classification problem, resulting in a direct bound on the minimum probability Ω that the true regression function is within ±2 of the regression model. This minimax probability machine regression (MPMR) model assumes only that the mean and covariance matrix of the distribution that generated the regression data are known; no further assumptions on conditional distributions are required. Theory is formulated for determining when estimates of mean and covariance are accurate, thus implying robust estimates of the probability bound Ω. Conditions under which the MPMR regression surface is identical to a standard least squares regression surface are given, allowing direct generalization bounds to be easily calculated for any least squares regression model (which was previously possible only under very specific, often unrealistic, distributional assumptions). We further generalize these theoretical bounds to any superposition of basis functions regression model. Experimental evidence is given supporting these theoretical results.
منابع مشابه
Sparse Greedy Minimax Probability Machine Classification ; CU-CS-956-03
The Minimax Probability Machine Classification (MPMC) framework [Lanckriet et al., 2002] builds classifiers by minimizing the maximum probability of misclassification, and gives direct estimates of the probabilistic accuracy bound Ω. The only assumptions that MPMC makes is that good estimates of means and covariance matrixes of the classes exist. However, as with Support Vector Machines, MPMC i...
متن کاملRobust Minimax Probability Machine Regression Robust Minimax Probability Machine Regression
We formulate regression as maximizing the minimum probability (Ω) that the true regression function is within ±2 of the regression model. Our framework starts by posing regression as a binary classification problem, such that a solution to this single classification problem directly solves the original regression problem. Minimax probability machine classification (Lanckriet et al., 2002a) is u...
متن کاملProbabilistic Random Forests: Predicting Data Point Specific Misclassification Probabilities ; CU-CS-954-03
Recently proposed classification algorithms give estimates or worst-case bounds for the probability of misclassification [Lanckriet et al., 2002][L. Breiman, 2001]. These accuracy estimates are for all future predictions, even though some predictions are more likely to be correct than others. This paper introduces Probabilistic Random Forests (PRF), which is based on two existing algorithms, Mi...
متن کاملNonparametric Classification with Polynomial MPMC Cascades ; CU-CS-955-03
A new class of nonparametric algorithms for high-dimensional binary classification is presented using cascades of low dimensional polynomial structures. Construction of polynomial cascades is based on Minimax Probability Machine Classification (MPMC) [Lanckriet et al., 2002], which results in direct estimates of classification accuracy, and provides a simple stopping criteria that does not requ...
متن کاملA Formulation for Minimax Probability Machine Regression
We formulate the regression problem as one of maximizing the minimum probability, symbolized by Ω, that future predicted outputs of the regression model will be within some ±ε bound of the true regression function. Our formulation is unique in that we obtain a direct estimate of this lower probability bound Ω. The proposed framework, minimax probability machine regression (MPMR), is based on th...
متن کامل